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Abstract. A finite Sturmian word w over the alphabet {a, b} is left 
special (resp. right special) if aw and bw (resp. wa and wb) are both 
£H Sturmian words. A bispecial Sturmian word is a Sturmian word that 

^■j is both left and right special. We show as a main result that bispecial 

5 Sturmian words are exactly the maximal internal factors of Christoffel 

qq words, that are words coding the digital approximations of segments in 

the Euclidean plane. This result is an extension of the known relation 
between central words and primitive Christoffel words. Our characteri- 
zation allows us to give an enumerative formula for bispecial Sturmian 
h— J words. We also investigate the minimal forbidden words for the set of 

f-H Sturmian words. 

Keywords: Sturmian words, Christoffel words, special factors, minimal forbid- 
^j- den words, enumerative formula. 
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1 Introduction 

Sturmian words are non-periodic infinite words of minimal factor complexity. 
They are characterized by the property of having exactly n + 1 distinct factors of 
length n for every n > (and therefore are binary words) |16 . As an immediate 
consequence of this property one has that in any Sturmian word there is a unique 

. . factor for each length n that can be extended to the right with both letters into 

a factor of length n + 1. These factors are called right special factors. Moreover, 
since any Sturmian word is recurrent (every factor appears infinitely often) there 

^ is a unique factor for each length n that is left special, i.e., can be extended to 

the left with both letters into a factor of length n + 1. 

The set St of finite factors of Sturmian words coincides with the set of binary 
balanced words, i.e., binary words having the property that any two factors of 
the same length have the same number of occurrences of each letter up to one. 
These words are also called (finite) Sturmian words and have been extensively 
studied because of their relevant role in several fields of theoretical computer 
science. 

If one considers extendibility within the set St, one can define left special 
Sturmian words (resp. right special Sturmian words) M as those words w over 
the alphabet £ = {a, b} such that aw and bw (resp. wa and wb) are both 



Sturmian words. For example, the word aab is left special since aaab and baab 
are both Sturmian words, but is not right special since aabb is not a Sturmian 
word. 

Left special Sturmian words are precisely the binary words having suffix 
automaton^] with minimal state complexity (cf. 11,17). From combinatorial 
considerations one has that right special Sturmian words are the reversals of left 
special Sturmian words. 

The Sturmian words that are both left special and right special are called bis- 
pecial Sturmian words. They are of two kinds: strictly bispecial Sturmian words, 
that are the words w such that awa, awb, bwa and bwb are all Sturmian words, 
or non-strictly bispecial Sturmian words otherwise. Strictly bispecial Sturmian 
words have been deeply studied (see for example [5j[9] ) because they play a cen- 
tral role in the theory of Sturmian words. They are also called central words. 
Non-strictly bispecial Sturmian words, instead, received less attention. 

One important field in which Sturmian words arise naturally is discrete geom- 
etry. Indeed, Sturmian words can be viewed as digital approximations of straight 
lines in the Euclidean plane. It is known that given a point (p, q) in the discrete 
plane Z x Z, with p, q > 0, there exists a unique path that approximates from 
below (resp. from above) the segment joining the origin (0, 0) to the point (p, q). 
This path, represented as a concatenation of horizontal and vertical unitary seg- 
ments, is called the lower (resp. upper) Christoffel word associated to the pair 
(p, q). If one encodes horizontal and vertical unitary segments with the letters 
a and b respectively, a lower (resp. upper) Christoffel word is always a word of 
the form awb (resp. bwa), for some w; G Z 1 *. If (and only if) p and q are co- 
prime, the associated Christoffel word is primitive (that is, it is not the power 
of a shorter word). It is known that a word w is a strictly bispecial Sturmian 
word if and only if awb is a primitive lower Christoffel word (or, equivalently, 
if and only if bwa is a primitive upper Christoffel word). As a main result of 
this paper, we show that this correspondence holds in general between bispecial 
Sturmian words and Christoffel words. That is, we prove (in Theorem |5| that w 
is a bispecial Sturmian word if and only if there exist letters x,y in {a, b] such 
that xwy is a Christoffel word. 

This characterization allows us to prove an enumerative formula for bispecial 
Sturmian words (Corollary [I]): there are exactly 2n + 2 — (j)(n + 2) bispecial 
Sturmian words of length n, where <ft is the Euler totient function, i.e., 4>{n) is 
the number of positive integers smaller than or equal to n and coprime with 
n. It is worth noticing that enumerative formulae for left special, right special 
and strictly bispecial Sturmian words were known 9 , but to the best of our 
knowledge we exhibit the first proof of an enumerative formula for non-strictly 
bispecial (and therefore for bispecial) Sturmian words. 

We then investigate minimal forbidden words for the set St of finite Sturmian 
words. The set of minimal forbidden words of a factorial language L is the set of 
words of minimal length that do not belong to L 115]. Minimal forbidden words 



1 The suffix automaton of a finite word w is the minimal deterministic finite state 
automaton accepting the language of the suffixes of w. 



represent a powerful tool to investigate the structure of a factorial language 
(see [1 1). We give a characterization of minimal forbidden words for the set of 
Sturmian words in Theorem [3J We show that they are the words of the form ywx 
such that xwy is a non-primitive Christoffel word, where {x,y} = {a, b}. This 
characterization allows us to give an enumerative formula for the set of minimal 
forbidden words ( Corollary [2|: there are exactly 2(n— 1— 0(n)) minimal forbidden 
words of length n for every n > 1 . 

The paper is organized as follows. In Sec. [2] we recall standard definitions 
on words and factors. In Sec. [3] we deal with Sturmian words and Christoffel 
words, and present our main result. In Sec. [4] we give an enumerative formula for 
bispecial Sturmian words. Finally, in Sec. [5j we investigate minimal forbidden 
words for the language of finite Sturmian words. 



2 Words and special factors 



Let E be a finite alphabet, whose elements are called letters. A word over E is 
a finite sequence of letters from E. A right-infinite word over E is a non-ending 
sequence of letters from E. The set of all words over E is denoted by £*. The 
set of all words over E having length n is denoted by E n . The empty word has 
length zero and is denoted by e. For a subset X of Z 1 *, we note X(n) = \Xf)E n \. 
Given a non-empty word w, we let w[i] denote its z-th letter. The reversal of 
the word w = w[l]w[2] • • • w[n], with w[i] G E for 1 < i < n, is the word 
w = w[n]w[n — 1] • • • w[l]. We set e = e. A palindrome is a word w such that 
w = w. A word is called a power if it is the concatenation of copies of another 
word; otherwise it is called primitive. For a letter a G E, \w\ a is the number of 
a's appearing in w. A word w has period p, with < p < \w\, if w[i) = w[i + p] 
for every i = l,...,\w\ — p. Since \w\ is always a period of w, every non-empty 
word has at least one period. 

A word z is a factor of a word w if w = uzv for some u, v G E* . In the 
special case u = e (resp. v = e), we call z a prefix (resp. a suffix) of w. We let 
Pref(w), Suff(w) and Fact(w) denote, respectively, the set of prefixes, suffixes 
and factors of the word w. The factor complexity of a word w is the integer 
function f w (n) = \Fact(w) H E n \, n > 0. 

A factor u of a word w is left special (resp. right special) in w if there exist 
a, b G E, a =^ 6, such that au, bu G Fact(w) (resp. ua, ub G Fact(w)). A factor u of 
w is bispecial in w if it is both left and right special. In the case when E — {a, 6}, 
a bispecial factor u of w is said to be strictly bispecial in w if aua, aub, bua, bub 
are all factors of w; otherwise u is said to be non-strict ly bispecial in w. For 
example, let w = aababba. The left special factors of w are e, a, aft, b and ba. 
The right special factors of w are £, a, a6 and 6. Therefore, the bispecial factors 
of w are £, a, a6 and 6. Among these, only e is strictly bispecial. 

In the rest of the paper we fix the alphabet E = {a, b}. 



3 Sturmian words and Christoffel words 



A right-infinite word w is called a Sturmian word if f w (n) = n+1 for every n > 0, 
that is, if w contains exactly n + 1 distinct factors of length n for every n > 0. 
Sturmian words are non-periodic infinite words of minimal factor complexity 6 . 
A famous example of infinite Sturmian word is the Fibonacci word 

F = abaababaabaababaababaabaababaabaab • • • 

obtained as the limit of the substitution a \-> ab, b \-> a. 

A finite word is called Sturmian if it is a factor of an infinite Sturmian word. 
Finite Sturmian words are characterized by the following balance property [To] : 
a finite word w over £ = {a, b} is Sturmian if and only if for any u, v G Fact(w) 
such that \u\ = \v\ one has ||ix| a — \v\ a \ < 1 (or, equivalently, \\u\b — \v\b\ < 1). 
We let St denote the set of finite Sturmian words. The language St is factorial 
(if w = uv G St, then u, v G St) and extendible (for every w G St there exist 
letters x, y G U such that xwy G St). 

Let w be a finite Sturmian word. The following definitions are in [9]. 

Definition 1. A word w G E* is a left special (resp. right special) Sturmian 
word if aw,bw G St (resp. if wa,wb G St). A bispecial Sturmian word is a 
Sturmian word that is both left special and right special. Moreover, a bispecial 
Sturmian word is strictly bispecial if awa, awb, bwa and bwb are all Sturmian 
word; otherwise it is non-strictly bispecial. 

We let L5, RS, BS, SBS and NBS denote, respectively, the sets of left spe- 
cial, right special, bispecial, strictly bispecial and non-strictly bispecial Sturmian 
words. Hence, BS= LSf)RS= SBS U NBS. 

The following lemma is a reformulation of a result of de Luca [8] . 

Lemma 1. Let w be a word over E. Then w G LS (resp. w G RS) if and only 
if w is a prefix (resp. a suffix) of a word in SBS. 

Given a bispecial Sturmian word, the simplest criterion to determine if it is 
strictly or non-strictly bispecial is provided by the following nice characterization 

|9: 

Proposition 1. A bispecial Sturmian word is strictly bispecial if and only if it 
is a palindrome. 

Using the results in ^1 , one can derive the following classification of Sturmian 
words with respect to their extendibility. 

Proposition 2. Let w be a Sturmian word. Then: 

— \UwU D St\ = 4 if and only if w is strictly bispecial; 

— \EwU D St\ = 3 if and only if w is non-strictly bispecial; 

— \£w£P\St\ =2 if and only ifw is left special or right special but not bispecial; 

— \EwE n St\ = 1 if and only if w is neither left special nor right special. 



Example 1. The word w = aba is a strictly bispecial Sturmian word, since awa, 
awb, bwa and bwb are all Sturmian words, so that \UwUC\St\ = 4. The word w — 
abab is a bispecial Sturmian word since wa, wb, aw and bw are Sturmian words. 
Nevertheless, awb is not Sturmian, since it contains aa and bb as factors. So w 
is a non-strictly bispecial Sturmian word, and \EwE n St\ = 3. The Sturmian 
word w = aab is left special but not right special, and \2JwU n St\ = 2. Finally, 
the Sturmian word w — baab is neither left special nor right special, the only 
word in Ew U D St being aw a. 

We now recall the definition of central word [9]. 

Definition 2. ^4 word over £ is central if it has two coprime periods p and q 
and length equal to p + q — 2. 

A combinatorial characterization of central words is the following (see [8]): 

Proposition 3. A word w over U is central if and only if w is the power of 
a single letter or there exist palindromes P, Q such that w = PxyQ = QyxP , 
for different letters x,y G X 1 . Moreover, if |P| < \Q\, then Q is the longest 
palindromic suffix of w. 

Actually, in the statement of Proposition [3J the requirement that the words 
P and Q are palindromes is not even necessary [5]. 
We have the following remarkable result [9]: 

Proposition 4. A word over E is a strictly bispecial Sturmian word if and only 
if it is a central word. 

Another class of finite words, strictly related to the previous ones, is that of 
Christoffel words. 

Definition 3. Let n > 1 andp,q > be integers such thatp-\-q = n. The lower 
Christoffel word w PA is the word defined for 1 < i < n by 



Example 2. Let p = 6 and q = 4. We have {iA mod(10) | i = 0, 1, . . . , 10} = 
{0, 4, 8, 2, 6, 0, 4, 8, 2, 6, 0}. Hence, wq^ = aababaabab. 

Notice that for every n > 1, there are exactly n — 1 lower Christoffel words 
w Py q, corresponding to the n — 1 pairs (p, q) such that p, q > and p + q = n. 

Remark 1. In the literature, Christoffel words are often defined with the ad- 
ditional requirement that gcd(p, q) = 1 (cf. 3 ). We call such Christoffel words 
primitive, since a Christoffel word is a primitive word if and only if gcd(p, q) = 1. 




a if iq mod(n) > (i — l)q mod(ri), 
b if iq mod(n) < (i — l)q mod(n). 
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Fig. 1. The lower Christoffel word W6,4 
word u>6 )4 = babaababaa (right). 
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= aababaabab (left) and the upper Christoffel 



If one draws a word in the discrete grid Z x Z by encoding each a with a 
horizontal unitary segment and each b with a vertical unitary segment, the lower 
Christoffel word w PjQ is in fact the best grid approximation from below of the 
segment joining (0, 0) to (p, g), and has slope q/p, that is, \w\ a = p and \w\b = q 
(see Fig.[l]). 

Analogously, one can define the upper Christoffel word w' pq by 
I a if ip mod(n) < (i — l)p mod(n), 

W p,qW = < 

I & if ip mod(n) > (i — l)p mod(n). 

Of course, the upper Christoffel word w' p q is the best grid approximation from 
above of the segment joining (0,0) to (p, q) (see Fig. [T]). 

Example 3. Let p = 6 and q = 4. We have {i6 mod(10) | i = 0, 1, . . . , 10} = 
{0, 6, 2, 8, 4, 0, 6, 2, 8, 4, 0}. Hence, w' Q A = babaababaa. 

The next result follows from elementary geometrical considerations. 

Lemma 2. For every pair (p,q) the word w' p q is the reversal of the word w Pjq . 

If (and only if) p and q are coprime, the Christoffel word w Pjq intersects 
the segment joining (0, 0) to (p, q) only at the end points, and is a primitive 
word. Moreover, one can prove that w p ^ q = aub and w' = bua for a palindrome 
u. Since u is a bispecial Sturmian word and it is a palindrome, u is a strictly 
bispecial Sturmian word (by Proposition[T]). Conversely, given a strictly bispecial 
Sturmian word ii, u is a central word (by Proposition [4| , and therefore has two 
coprime periods p, q and length equal to p + q — 2. Indeed, it can be proved that 
aub = w Pj q and bua — w' p q . The previous properties can be summarized in the 
following theorem (cf. [2]): 

Theorem 1. SBS = {w | xwy is a primitive Christoffel word, x,y G U}. 



If instead p and q are not coprime, then there exist coprime integers p' , q' such 
that p = rp' , q = rq' , for an integer r > 1. In this case, we have w PjQ = (w p ^ q ') r 
that is, is a power of a primitive Christoffel word. Hence, there exists a 
central Sturmian word u such that w VA = (aub) r and k;' = (bua) r . So, we 
have: 

Lemma 3. The word xwy, x ^ y e £, is a Christoffel word if and only if 
w = (uyx) n u, for an integer n > and a central word u. Moreover, xwy is a 
primitive Christoffel word if and only if n = 0. 

Recall from M that the right (resp. left) palindromic closure of a word w 
is the (unique) shortest palindrome (resp. w^) such that w is a prefix 

of (resp. a suffix of w^). If w = uv and v is the longest palindromic 

suffix of w (resp. u is the longest palindromic prefix of w), then = wu 

(resp. vj(~^ — vw). 

Lemma 4. Let xwy be a Christoffel word, x,y G ZJ. Then w^ and w^ are 
central words. 

Proof. Let xwy be a Christoffel word, x,y £ U. By Lemma |3j w — (uyx) n u, 
for an integer n > and a central word u. We prove the claim for the right 
palindromic closure, the claim for the left palindromic closure will follow by 
symmetry. If n = 0, then w = u : so w is a palindrome and then w^ = w is a 
central word. So suppose n > 0. We first consider the case when u is the power 
of a single letter (including the case u = e). We have that either w = (y k+1 x) n y k 
or w = (x k yx) n x k for some k > 0. In the first case, w^ = wy = (y fe+1 x) n y /c+1 , 
whereas in the second case w^ = wyx k = (x k yx) n x k yx k . In both cases one 
has that w^ is a strictly bispecial Sturmian word, and thus, by Proposition [4J 
a central word. 

Let now u be not the power of a single letter. Hence, by Proposition [3j there 
exist palindromes P, Q such that u = PxyQ = QyxP. Now, observe that 

w = (uyx) n u — Pxy(QyxPxy) n Q 

We claim that the longest palindromic suffix of w is (QyxPxy) n Q. Indeed, the 
longest palindromic suffix of w cannot be w itself since w is not a palindrome, so 
since any palindromic suffix of w longer than (QyxPxy) n Q must start in u, in 
order to prove the claim it is enough to show that the first non-prefix occurrence 
of u in w is that appearing as a prefix of (QyxPxy) n Q. Now, since the prefix 
v = PxyQyxP of w can be written as v = uyxP = Pxyu, one has by Proposition 
[3] that v is a central word. It is easy to prove (see, for example, [4]) that the 
longest palindromic suffix of a central word does not have internal occurrences, 
that is, appears in the central word only as a prefix and as a suffix. Therefore, 
since \u\ > |P|, u is the longest palindromic suffix of v (by Proposition [3|, and 
so appears in v only as a prefix and as a suffix. This shows that (QyxPxy) n Q is 
the longest palindromic suffix of w. 



Thus, we have = wyxP, and we can write: 

w (+) — Pxy(QyxPxy) n QyxP 

= PxyQ ■ yx ■ P(xyQyxP) n 
= (PxyQyx) n P • xy • QyxP 

so that = uyxz = zxyu for the palindrome z = P(xyQyxP) n = 

(PxyQyx) n P. By Proposition [3J is a central word. □ 

We are now ready to state our main result. 

Theorem 2. BS = {w | xwy is a Christoffel word, x,y G X 1 }. 

Proof. Let xwy be a Christoffel word, x, y G Then, by Lemma |3j w is of the 
form w = (uyx) n u, n > 0, for a central word u. By Lemma [4j u> is a prefix of 
the central word and a suffix of the central word w^~\ and therefore, by 
Proposition [4] and Lemma [T] w is a bispecial Sturmian word. 

Conversely, let w be a bispecial Sturmian word, that is, suppose that the 
words xw, yw, wx and wy are all Sturmian. If w is strictly bispecial, then w is 
a central word by Proposition [4j and xwy is a (primitive) Christoffel word by 
Theorem [l] So suppose w G NBS. By Lemma [3j it is enough to prove that w is of 
the form w = (uyx) n u, n > 1, for a central word u and letters x ^ y. Since w is 
not a strictly bispecial Sturmian word, it is not a palindrome (by Proposition [I]) . 
Let u be the longest palindromic border of w (that is, the longest palindromic 
prefix of w that is also a suffix of w), so that w = uyzxu, x^ye£,ze£*. 
If z = e, w = uyxu and we are done. Otherwise, it must be z = xz'y for some 
z' G Z 1 *, since otherwise either the word yw would contain yuy and xxu as 
factors (a contradiction with the hypothesis that yw is a Sturmian word) or 
the word wx would contain uyy and xux as factors (a contradiction with the 
hypothesis that wx is a Sturmian word). 

So w — uyxz'yxu. \iu — e^ then it must be z = (yx) k for some k > 0, since 
otherwise either xx would appear as a factor in w, and therefore the word yw 
would contain xx and yy as factors, being not a Sturmian word, or yy would 
appear as a factor in w, and therefore the word wx would contain xx and yy 
as factors, being not a Sturmian word. Hence, if u = e we are done, and so we 
suppose \u\ > 0. 

By contradiction, suppose that w is not of the form w = (uyx) n u. That is, 
let w = (uyx) k u'av, with k > 1, v G Z 1 *, u'b G Prej{uyx\ for different letters a 
and b. If \u'\ > \u\, then either \u'\ — \u\ or \u!\ = \u\ + 1. In the first case, u' = u 
and w = (uyx) k uxv' ', for some v' G Z 1 *, and then the word yw would contain 
yuy and xux as factors, being not a Sturmian word. In the second case, u' = uy 
and w = (uyx) k uyyv h for some G Z 1 *; since xii is a suffix of w, and therefore 
w = {uyx) k v'" xu for some v'" G Z 1 *, we would have that the word wx contains 
both uyy and xux as factors, being not a Sturmian word. Thus, we can suppose 
u'b G Pref(u). Now, if a = x and b = y, then the word i/t^ would contain the 
factors yu'y and xu'x, being not a Sturmian word; if instead a = y and b = 
let ii = u'xu" , so that we can write u> = (uyx) k u'yv = (uyx) k ~ x u' xu" yxu' yv. 



The word wx would therefore contain the factors u"yxu'y and xux = xu'xu"x 
(since xu is a suffix of w), being not a Sturmian word (see Fig. [2|. In all the 
cases we obtain a contradiction and the proof is thus complete. □ 
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Fig. 2. The proof of Theorem [2] 



So, bispecial Sturmian words are the maximal internal factors of Christoffel 
words. Every bispecial Sturmian word is therefore of the form w = (uyx) n u, 
n > 0, for different letters x, y and a central word u. The word w is strictly 
bispecial if and only if n = 0. If n = 1, w is a semicentral word El, that is, a 
word in which the longest repeated prefix, the longest repeated suffix, the longest 
left special factor and the longest right special factor all coincide. 

4 Enumeration of bispecial Sturmian words 

In this section we give an enumerative formula for bispecial Sturmian words. It 
is known that the number of Sturmian words of length n is given by 

n 

St(n) = 1 + J2( n ~ i + 
i=i 

where <p is the Euler totient function, i.e., <p(n) is the number of positive integers 



smaller than or equal to n and coprime with n (cf. 13 14 ). 

Let w be a Sturmian word of length n. If w is left special, then aw and bw 
are Sturmian words of length n + 1. If instead w if not left special, then only 
one between aw and bw is a Sturmian word of length n + 1. Therefore, we have 
LS(n) = St(n + 1) - St(n) and hence 

n+l 

LS(n) = J2<P(i) 

i=l 

Using a symmetric argument, one has that also 

n+l 

RS(n) = J2<f>(i) 

i=l 



Since 9 SBS(n) = LS(n + 1) - LS{n) = RS(n + 1) - i?5(n), we have 



SBS(n) = <j)(n + 2) 

Therefore, in order to find an enumerative formula for bispecial Sturmian 
words, we only have to enumerate the non- strictly bispecial Sturmian words. We 
do this in the next proposition. 

Proposition 5. For every n > 1, one has 

NBS(n) = 2 (n + 1 - <t>{n + 2)) 

Proof. Let 

W n = {w | awb is a lower Christoffel word of length n + 2} 

and 

= {w r | 6k/ a is an upper Christoffel word of length n + 2} 
By Theorem [2j the bispecial Sturmian words of length n are the words in W n U 

Ti 

Among the n + 1 words in W n , there are 0(n + 2) strictly bispecial Sturmian 
words, that are precisely the palindromes in W n . The n+1 — </>(n+2) words in W n 
that are not palindromes are non- strictly bispecial Sturmian words. The other 
non-strictly bispecial Sturmian words of length n are the n + 1 — (j)(n + 2) words 
in that are not palindromes. Since the words in W' n are the reversals of the 
words in W n , and since no non-strictly bispecial Sturmian word is a palindrome 
by Proposition [l] there are a total of 2(n + 1 — <p(n + 2)) non-strictly bispecial 
Sturmian words of length n. □ 

Corollary 1. For every n>0, there are 2(n + 1) — <p(n + 2) bispecial Sturmian 
words of length n. 

Example 4- The Christoffel words of length 12 and their maximal internal fac- 
tors, the bispecial Sturmian words of length 10, are reported in Table [l] 

5 Minimal forbidden words 

Given a factorial language L (that is, a language containing all the factors of its 
words) over the alphabet E, a word v G is a minimal forbidden word for L if v 
does not belong to L but every proper factor of v does (see [7] for further details). 
Minimal forbidden words represent a powerful tool to investigate the structure 
of a factorial language (cf. 1 ). In the next theorem, we give a characterization 
of the set MFSt of minimal forbidden words for the language St. 

Theorem 3. MFSt = {ywx | xwy is a non-primitive Christoffel word, x,y G U}. 
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aababaababab 


bababaababaa 


(6,6) 


abababababab 


babababababa 


(5,7) 


abababbababb 


bbababbababa 


(4,8) 


abbabbabbabb 


bbabbabbabba 


(3,9) 


abbbabbbabbb 


bbbabbbabbba 


(2,10) 


abbbbbabbbbb 


bbbbbabbbbba 


(1,11) 


abbbbbbbbbbb 


bbbbbbbbbbba 



Table 1. The Christoffel words of length 12. Their maximal internal factors are 
the bispecial Sturmian words of length 10. There are 4 = 0(12) strictly bispe- 
cial Sturmian words, that are the palindromes aaaaaaaaaa, ababaababa, bababbabab 
and bbbbbbbbbb (underlined), and 14 = 2(11 — 4) non-strictly bispecial Sturmian 
words: aaaaabaaaa, aaaabaaaaa, aaabaaabaa, aabaaabaaa, aabaabaaba, abaabaabaa, 
ababababab, bababababa, babbabbabb, bbabbabbab, bbabbbabbb, bbbabbbabb, bbbbabbbbb and 
bbbbbabbbb. 



Proof. If xwy is a non-primitive Christoffel word, then by Theorems [T] and |2j w is 
a non-strictly bispecial Sturmian word. This implies that ywx is not a Sturmian 
word, since a word w such that xwy and ywx are both Sturmian is a central 
word (9], and therefore a strictly bispecial Sturmian word (Proposition [4|. Since 
yw and wx are Sturmian words, we have ywx G MFSt. 

Conversely, let ywx G MFSt. By definition, yw is Sturmian, and therefore 
ywy must be a Sturmian word since St is an extendible language. Analogously, 
since wx is Sturmian, the word xwx must be a Sturmian word. Thus, w is 
a bispecial Sturmian word, and since ywx £ St, w is a non-strictly bispecial 
Sturmian word. By Theorems [T] and [2j xwy is a non-primitive Christoffel word. 

□ 

Corollary 2. For every n > 1, one has 

MFSt(n) = 2(n - 1 - <j>(n)) 



It is known from 14 that St(n) = 0(n 3 ), as a consequence of the estimation 
(see 12 , p. 268) 

y>W = — +0(n log n) (1) 

i=l 



From ([!]) and from the formula of Corollary [2j we have that 

n 

^MFSt(n) = 0(n 2 ) 
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